Channel-Optimized Quantum Error Correction 



Soraya Taghavi 

Department of Electrical Engineering and the Center for Quantum Information Science and Technology, 
University of Southern California, Los Angeles, CA 90089 

Robert L. Kosut 

SC Solutions, Systems & Control Division, 1261 Oakmead Pkwy., Sunnyvale, CA 94085 

Daniel A. Lidar 

Departments of Electrical Engineering, Chemistry, and Physics, 
and the Center for Quantum Information Science and Technology, 
University of Southern California, Los Angeles, CA 90089 

We develop a theory for finding quantum error correction (QEC) procedures which are optimized for given 
noise channels. Our theory accounts for uncertainties in the noise channel, against which our QEC procedures 
are robust. We demonstrate via numerical examples that our optimized QEC procedures always achieve a higher 
channel fidelity than the standard error correction method, which is agnostic about the specifics of the channel. 
This demonstrates the importance of channel characterization before QEC procedures are applied. Our main 
novel finding is that in the setting of a known noise channel the recovery ancillas are redundant for optimized 
quantum error correction. We show this using a general rank minimization heuristic and supporting numerical 
calculations. Therefore, one can further improve the fidelity by utilizing all the available ancillas in the encoding 
block. 



PACS numbers: 

I. INTRODUCTION 

Quantum error correction is often considered the backbone 
of quantum information processing, since it converts what is 
essentially an analog information processor, subject to a con- 
tinuum of errors, into a digital one, whose errors are dis- 
cretized. The theory of quantum error correction was devel- 
oped in analogy to classical coding for noisy channels [1-6]. 
These initial efforts focused on finding conditions and proce- 
dures for perfect recovery of quantum states passing through 
noisy channels. Recently, several authors considered error 
correction design as an optimization problem, with fidelity as 
the optimization target [7-10]. In this work we further de- 
velop the theory of optimal quantum error correction. As in 
[7-10], we consider the scenario where one has knowledge 
of the noise channel, and find correspondingly optimal codes. 
That is, we assume that one has already performed a channel 
identification procedure, e.g., via quantum process tomogra- 
phy [11]. We show how, armed with a knowledge of the chan- 
nel, one can design highly robust error correction procedures, 
whose fidelity is always at least as good as that of the "agnos- 
tic" codes of standard error correction [1-6]. 

More specifically, we present an indirect approach to fi- 
delity maximization based on minimizing the error between 
the actual channel and the desired channel. This approach, 
like the previously developed approach to direct fidelity opti- 
mization, leads naturally to bi-convex optimization problems, 
namely, two semide finite programs (SDPs) [12] which can be 
iterated between the recovery and encoding. For a given en- 
coding the problem is convex in the recovery. For a given 
recovery, the problem is convex in the encoding. An impor- 
tant advantage of this approach is that noisy channels, which 
do not satisfy the standard assumptions for perfect correction 



[1-6], can be optimized for the best possible encoding and 
recovery. 

The conventional fidelity optimization targets are the en- 
coding and recovery operators. An important way in which 
the present work differs from previous studies is in the fact 
that we further add the distribution of the ancillas in the en- 
coding and recovery to the optimization problem. This way, 
we utilize all possible degrees of freedom for optimization. 
As a consequence, we find a rather surprising result: in the 
optimized error correction procedure the fidelity is indifferent 
to the existence of the recovery ancillas. This result paves 
the way toward a more efficient utilization of the ancillas. 
Namely, we can use all the available ancilla qubits in the en- 
coding to increase the fidelity. 

Standard error correction schemes, as well as those pro- 
duced by the aforementioned optimization methods which are 
tuned to specific errors, are often not robust to even small 
changes in the error channel. These errors can be mitigated 
by fault-tolerant methods which rely on several levels of code 
concatenation [13]. However, our method naturally enjoys 
a desirable robustness against error variations. We show a 
means to incorporate specific models of error channel uncer- 
tainty, resulting in highly robust error correction. Neverthe- 
less, concatenated fault tolerant quantum error correction still 
enjoys a certain important advantage over the procedures we 
derive in this work, namely, it is robust also against imperfec- 
tions in the encoding and recovery procedures, while here, as 
is the case for standard quantum error correction formulations, 
we assume these to be perfectly executed. 

Since the number of optimization variables scales exponen- 
tially with the number of qubits used in the encoding and re- 
covery operations, the computational effort required to solve 
any of the semidefinite program optimization (SDP) problems 
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is similarly burdened. In order to reduce this effort we pro- 
pose an approach based on optimization via the constrained 
least squares method. This alternative approach for solving 
the indirect optimization problem does not utilize semidefi- 
nite programming, and is significantly faster in our numerical 
simulations. Surprisingly, this method returns the exact same 
result as the SDP approach. 

The organization of the paper is as follows. In Section II, 
we explain the problem formulation including the standard 
error correction model and state the direct and indirect op- 
timization problems to be addressed. The indirect approach 
is explored in Section III. In Section IV, we investigate the 
optimal distribution of the ancillas between the recovery and 
encoding. Examples of the methods presented are given in 
Section V. Appendices A-E provide proofs and supporting 
material. 



equivalently expressed, and consequently physically imple- 
mented, as a unitary with ancilla states [6, §8.23]. An example 
of this representation of the standard error correction model of 
Figure 1 is shown in the block diagram of Figure 2. 



PC 



a c 



Ps 
\0ca) 





i ; ► 




J-i ► 




U c 


', > > 


U E 




U R 








— r; — *~ 










\0r'a) — ► 












PB 




bath 





■PS 



FIG. 2: System-ancilla-bath representation of standard encoding 
error-recovery model of error correction. 



II. PROBLEM FORMULATION 

A. Standard error correction model 

Subject to the assumption that the initial system-bath state 
is classically correlated [14], the dynamics of an open quan- 
tum system can be represented in an elegant form known as 
the Kraus operator sum representation (OSR). In this repre- 
sentation, the noise £ is described in terms of a completely- 
positive (CP) map: p — > X^i^/ ^! Here p is the initial 
system density matrix and the operators Ai, known as Kraus 
operators, or operation elements, satisfy the normalization re- 
lation A\Ai = I (identity). The standard error correction 
procedure involves CP encoding (C), error (£), and recovery 

C £ K 



(7Z) maps (or channels): p s — > p c — > a c 
pictorially in the block diagram of Figure 1 . 
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FIG. 1: Standard representation of error correction. 

Here p s is the n s x n s system state, p c is the n c x n c en- 
coded state, <t c is the n c x n c perturbed encoded state, and 
p s is the n s x n s recovered system state. Using the OSR: 

PS = Y.iR^^PsiRrE.C^. 



(1) 



The encoding {Cc}).™^ and recovery {R r }™=i operation el- 
ements are rectangular matrices, respectively n c x n s and 
n s x n C' smce tne y ma P between the system Hilbert space 
of dimension ns and the system/ancillas Hilbert space, the 
codespace, of dimension nc- The error operation elements 
{-E'eleJi are square (n c x n c ) matrices, and represent the 
effects of noise on the codespace. The number of elements, 
m c , m E , m R depend on the manner of implementation and 
basis representation [6]. More specifically, any OSR can be 



In this case the encoding operation C is implemented by 
a unitary operator Uc acting on the (tensor) product of the 
system state, p s , and the encoding ancillas' state, |0ca), pro- 
ducing the encoded state p c = Uc(\Qca) (Oca\ ® Ps)^c- 
(The tensor ordering is arbitrary, but once established must 
remain fixed for consistency). If the encoding ancillas' state 
has dimension n CA , then the resulting codespace has dimen- 
sion n c = n s n CA . If, as is customary, we take |0cm) as tne 
n CA -column vector with a one in the first element and zeros 
elsewhere (i.e., it is a tensor product of log 2 ncA encoding 
ancillas, each in the state |0) = (1, 0)'), then the OSR for C 
has the single (m c = 1) n c x n s matrix element C whose 
columns are the first n s columns of Uc, thus forming a set of 
orthonormal codewords, i.e., 



U C = [C ■••], Cis; 



(2) 



For the errors, £, the ancillas' states are not implemented 
by design, but rather, engendered by interaction with the bath, 
a term used to generically describe the physical environment. 
The error operation is thus equivalent to the unitary U e oper- 
ating on the tensor product of p c , the encoded state, and ps, 
the bath state. The number of bath states may be very large, 
in principal infinite dimensional. However, it is always possi- 
ble to represent £ with a finite number of OSR elements with 
m E < n c 2 [6, Thm.8.3]. 

Finally, the recovery operation 1Z can be implemented via 
the unitary Ur operating on the (tensor) product of the per- 
turbed encoded state, er c , and the (additional) recovery ancil- 
las' state |0ha)- If lOi?^) is an n^-column vector with a 
one in the first element and zeros elsewhere, then the OSR 



{-Rr} r J\ for 1Z has m R = n CA n RA 
of the first n c columns of Ur, i.e., 

Ri 



U R = [R---],R = 



Rr, 



elements which consist 



R r is n s x n c (3) 



The model represented in Figures 1 and 2 assumes that the 
encoding and recovery operations can be implemented much 
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TABLE I: Definitions of some frequently used symbols. 



Symbol 


Definition 


ns 


dimension of the system space 


ncA 


dimension of the encoding ancillas space 


nc 


dimension of the (system + encoding ancillas) space, i.e., nc = ns X ncA 


riRA 


dimension of the recovery ancillas space 


m E 


number of operation elements for error map 


m R 


number of operation elements for recovery map 



faster than relevant time-scales associated with the bath. For a 
detailed discussion of the validity of such a Markovian model 
see [18]. Nevertheless, we will assume the model of Figures 
1 and 2 for the remainder of this work, as complications asso- 



ciated with the bath being "on" during encoding and recovery 
are likely to be dealt with via fault tolerance methods [19], 
which require a base level of encoding of the type we find 
here. 



Table I provides definitions of some frequently used symbols. 



B. Performance measures 



Assume that we are given the OSR elements of the error 
channel £. This could be obtained, for example, from the out- 
put of a quantum process tomography experiment [11]. The 
error correction objective considered here is to design the en- 
coding C and the recovery 1Z so that, for a given error opera- 
tion £, the map ps — ► ps is as close as possible to a desired 
n s x n s unitary logic gate L$. Common measures of per- 
formance between two quantum channels are typically based 
on fidelity or distance [6], [20], [21], [22]. Here we will use 
the channel fidelity [7] between the error correction operation 
7Z£C and the desired operation L$'. 

/ = V |Tr L\R r E e C c \ 2 . (4) 

* r,e,c 

where < / < 1 and from [5], [6, Thm.8.2], / = 1 if and 
only if there are constants 5 rec such that, 

R r E e C c = 6 rec Ls, 2^ \$rec\ 2 = 1- (5) 

r,e,c 

This suggests the indirect measure of fidelity, the "distance- 
like" error (using the Frobenius norm, = Tr X'X), 

d = J2r,e,c \\RrE e C c — 5 rec L S \\p 

= E c \\RE(I E ®C c )-A c ®L s f F 

where 

A c = [5 rec ], dimA c = m R xm E , (7) 
and E is the n c x n c m E rectangular "error matrix," 

E=[E 1 ■■■ E m J, (8) 



R is the m R n s x n c matrix obtained by stacking the m R 
matrices R r as in (3), and Ie is the m E x m E identity. Hence, 
we have £ c Tr AjA c = £ r , e , c I'M 2 - 1, and R^R = 

J2 r R\Rr = Ic- 

We show in Appendix A that there exists a recovery and 
encoding pair, 1Z,C, which achieves perfect error correction 
(equivalently d = 0, / = 1), iff for c, d = 1, . . . , m c 

{I e ®CI)E^E{Ie®C c .)=A\A c .®I s (9) 

This is a generalization to non-unitary CP encoding of the 
Knill-Lafiamme condition for perfect error correction with 
unitary encoding [5]. In this latter case, C has only a single 
n c x n s matrix element C, C^C = Is, whose n s columns 
are the codewords. As / and d are explicitly dependent on the 
channel elements, they are convenient for optimization. Con- 
sider then the following optimization problems. 

Direct Fidelity Maximization 
maximize f(R, C) (10) 
subject to R^R = I Cl &C = Is 

Indirect Fidelity Maximization 
minimize d(R, C, Ai, . . . , A mc ) 
subject to R^R = I C , &C = I S , E c ll A c||! = l 

(11) 

Here C is the n c x n s matrix obtained in (2) . The direct ap- 
proach was used in [7, 9, 10, 15-17]. As shown in Appendix 
B, / and d are related as follows: 

f(R,C) = (l-d(R,C)/2n s y 

d(R,C) = mm{d(R,C,Ax,...,A mc | ||A C ||£ = 1, Vc } 

(12) 

This shows that minimizing the distance (11) is equivalent to 
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maximizing fidelity (10). 



III. INDIRECT FIDELITY MAXIMIZATION 



C. Robust error correction 

An important advantage of the method presented here is 
that unlike the standard error correction model, it accounts for 
uncertainty in knowledge of the channel. Such uncertainty 
may exist for many reasons. For example, different runs of 
a tomography experiment can yield different error channels 
{£a}a=i- Or, a physical model of the error channel might 
be generated by a Hamiltonian H(0) dependent upon an un- 
certain set of parameters 9. In any case, not accounting for 
the uncertainties typically leads to non-robust error correc- 
tion, in the sense that a small change in the error model can 
lead to poor performance of the error correction procedure. 
One way to account for these Hamiltonian parametric uncer- 
tainties is to take a sample from the set of Hamiltonians, say, 
{H(9 a )} £ a=1 . Tracing out the rat; bath states will result in a 
set of error channels {£ a }a=i where each error channel has 
OSR elements {E a ^}% =1 , where n is the largest of the num- 
ber of OSR elements in each sample. In those samples with a 
smaller number we can set the corresponding OSR elements 
to zero. 

Two standard measures of robustness are the average-case 
and worst-case. For the average-case, suppose that each OSR 
set £ a is known to occur with probability p a . Then define the 
average error channel by the OSR, 



£ avg = { V^£ Q , fe |a = l,. 



1,. 



} (13) 



The average error channel in this form has k£ OSR elements, 
potentialy a very large number. However, this number is read- 



ily reduced to no more than m E 



using a singular value 



decompostion [6, Thm.8.3]. Associated with £ avg is the aver- 
age channel fidelity, 



r 



- ^Pafa - 2 



E 



Tr RrE^Q 



(14) 



1. 



Ik are the OSR elements of £ avg in 



where E^7 g , 
(13). 

For average-case robust error correction we replace / in 
(10) with / avg in (14), and using the relationship (12), replace 
din (11) with, 



d avg = \\RE av £(I E ®C) 

£avg = [£avg . . . ^avgj 



A(g)Lc 



(15) 



A similar formulation exists for worst-case error correction 
which was considered in [10]; we do not consider it any fur- 
ther here. The remainder of the paper concentrates on the 
average-case objective and development of the associated op- 
timization algorithms. The examples presented in Sec.V show 
that this approach yields a high degree of robustness to uncer- 
tainty in the optimal codes. 

We now discuss methods to approximately solve (obtain lo- 
cal solutions to) the indirect optimization problem (11). 



We consider the encoding operator C as a unitary operator 
acting on both the encoding ancillas and the input qubit. Using 
the constraints in (11), we can express the distance measure 
(6) as 



d(R,C,A) = \\RE(I E ®C)-A®L s \\l 
= n s + Tr E(I E ® CC ] )E^ - 2Re Tri?S(A f 



(16) 

•CL\) 



where A is the single tub X m E matrix in (7) with tur = 
n-CAnRA (note that in this case, since there is only a single 
A c matrix, we drop the subscript c). 



A. Optimal Recovery 



Since only the last term in (16) depends on the recovery 
matrix R, minimizing d(R, C, A) with respect to R is equiv- 
alent to maximizing the last term. In Appendix A, we show 
that this maximization results in 



max Re Tr i? J B(A t <g> CL\) = Tr yj E(T ® CC^)E^, 

(17) 

where the m E x m E matrix T is defined as, 



r = A f A, 



(18) 



and the associated ncn>RA x n c optimal recovery matrix is, 



R = [vi ... v nc ][ui 



It 



(19) 



where Vi, Ui, i = 1, . . . , nc are, respectively, the right and 
left singular vectors in the singular value decomposition of the 
nc x ncnjiA matrix E(A 1f (g) CLg), with the singular values, 
as usual, in descending order. Thus, to obtain the optimal 
recovery, we need first to find T which maximizes (17) - this 
is equivalent to minimizing d over R. Following this we need 
to determine A satisfying (18). 

To find T, observe that T > by definition (18), and the 
constraint ||A||f = 1 from (11) is equivalent to Tr T = 1. 
Hence, optimal recovery can be obtained by first solving for 
r from, 



maximize Tr^^T ® CC^)E^ 
subject to T > 0, Tr T = 1 



(20) 



In Appendix D it is shown that the optimal T is the solution of 
an equivalent SDR 

The next step is to use (18) to obtain A from T. The fol- 
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lowing choice adheres to the given dimensions: 



n 

n Di n 



CA < m E 



RA CA 



RA 



A = Vr 

R is tall (n s m E x n c ) 

Vr 

On CA -m E xm E 

R is unitary (n c x n c ) 



(21) 

Clearly the choice of A is not unique. In fact, the result does 
not change if A is multiplied by a unitary, i.e., A — > U A. This 
is exactly the unitary freedom in choosing the OSR elements 
[6]. Interestingly, however, from many numerical calculations 
we observe that the following holds: 



rank(r) = n C A if n C A < m E 
rank(r) — m E if n C A > m E . 



(22) 



Since the m E x matrix T is Hermitian (= A^A), and A 
n CA n RA , it follows that if (22) is 



is m R x m E with m R 
true then, 



riRA = 1. 



(23) 



If, in the optimized error correction, the dimension of the re- 
covery ancillas space is one, then the optimal recovery matrix 
R is always a unitary - recovery ancillas are redundant in 
maximizing the fidelity. Note that we started with a generic 
n R A parameter, and the properties of the optimal solution led 
us to the above conclusion. Although we do not have a rig- 
orous proof that the recovery ancillas are redundant, a com- 
pelling heuristic argument is offered in Section IV along with 
supporting numerical results. 



B. Optimal Encoding 

For a given R and A, the optimal encoding C can be found 
by solving (11) for C, that is, 

minimize d(R, C, A) = \\RE(I E <X> C) - A <g> L s ||| 
subject to C^C = Is 

(24) 

As shown in Appendix E, the optimal encoding C is given by, 



where (U, V) are obtained from the SVD, 

C = Y^ S re {R r E e )^L s = USV^ 



with U an 



x fin matrix with orthonormal columns, i.e., 



UW = Is, V an n s X n s unitary, and S a diagonal matrix 
of the n s singular values. The matrix C is the unconstrained 
(least-squares) solution to (24), i.e., mine d. 

The left-hand column of Table II, labeled Algorithm- 1, 
summarizes the preceding method for recovery and encoding 
optimization. For optimal recovery alone, solve (20) for Y, 
then determine A via (21), and finally R from (19). For op- 
timal encoding alone, solve (24) for C. To find a combined 
optimal encoding and recovery repeat steps 1 and 2 in Table II 
until d stops decreasing. (By virtue of (12), fidelity increases 
in every step). Since in each step the distance measure, d, can 
only decrease, never increase, the converged solution to the 
combined optimization is only guaranteed to be a local opti- 
mal solution to (11). 



C. Alternative iterative algorithm for recovery optimization 



An alternative to the above optimal recovery procedure 
(Step 1 in Algorithm- 1 of Table II) is to iterate between solving 
(11) directly by minimizing over A and then using (19) to find 
R. Specifically, for a given R and C, Step 2a in Algorithm- 
2of Table II requires solving the following constrained least- 
squares problem for A: 

minimize d(R, C, A) = \\RE(I E ® C) - Ag)L s ||| 
subject to || A||p = 1 

(26) 

As shown in Appendix B, the solution is, 



A = A/||A|| F 
(A) re = Tr(RrE e CL s )/i 



(27) 



s- 



C = UV^ 



(25) 



where A is the unconstrained (least squares) solution to 
minA d. This solution is then used in (19) to find R (Step lb), 
then back to (27) (Step la), and so on until d stops decreasing 
(Step lc). 



The difference between the two algorithms is in computing the 
optimal recovery (Steps 1). In Step 1 of Algorithm- 1, no itera- 
tions are required; the optimal recovery is achieved by solving 
the SDP (20). For Step 1 of Algorithm-2, an optimal recov- 
ery is the result of some number of iterations involving the 
constrained least-squares problem (26). Although at present a 
proof is not available, in every case we have tried the optimal 



fidelity in both recovery algorithms converges to the same re- 
sult. Additionaly, the total CPU-time in MATLAB to compute 
the optimal recovery in Algorithm-2 (including the iterations) 
is significantly less than the CPU-time for the recovery step 
in Algorithm- 1 using YALMIP [28] to call the solver SDPT3 
[29]. 



6 



TABLE II: Iterative Algorithms for Optimal QEC 



Algorithm- 1 


Algorithm-2 


Initialize R, and C 


Initialize R and C 


Repeat 


Repeat 


1 . Optimal recovery 


1 . Optimal recovery - Repeat a-c 


(a) solve (20) for V 


(a) solve (26) for A 


(b) T -> A via (21) 


(b) A -> i? via (19) 


(c) A ->■ R via (19) 


(c) Until d stops decreasing 


2. Optimal encoding 


2. Optimal encoding 


(a) solve (24) for C 


(a) solve (24) for C 


Until d stops deceasing 


Until <i stops deceasing 



IV. DIMENSION OF THE RECOVERY ANCILLAS SPACE 

In our formalism, the dimension of the Recovery ancillas' 
space, i.e., the required number of recovery ancilla qubits, is 
determined by the rank of the vtle X uie matrix T. 

A. Rank minimization of T 

In this section, we study the rank of T through a heuristic 
argument by noting the similarity between our problem and 
the so called "Rank Minimization Problem" (RMP) [24]: 



minimize rank (X) 
subject to X G X 



(28) 



The matrix X is the optimization variable and A" is a convex 
set denoting the constraints. 

Although several special cases of the RMP have well- 
known solutions, in general the RMP is known to be computa- 
tionally intractable. However, there are a number of heuristic 
approaches to solving this problem. Restate (20) as follows, 



minimize 



Tr(r) 



subject to T > 0, TYyjE{T ®C0)E^ > const. 



(29) 



where the constant is the maximum which arose in (20). A 
well known heuristic for RMP when X is positive semidefi- 
nite [25-27] is to replace the rank objective with Tr [X] and 
solve, 



minimize Tr [X] 
subject to X G X, X > 



(30) 



By comparing (29) with (30), we can view our problem in (20) 
as an RMP that minimizes the rank of T. Thus, the rank of the 
optimal T is the smallest possible consistent with not changing 
the rank of our objective matrix, y/ E(T (£) CC^)E^ . Noting 



that rank(CCt) = ns and rank(iJ) = ric and with a straight- 
forward linear algebra analysis we find that this property holds 
if 



if n C A < m E 
I rank(T) = tue if tt-ca > rn-E- 



f rank(T) > n C A 



(31) 



is. 



in the first 



That 

n C A, rank( ^ E(T 



case, if rank(T) < 
7XX)Ef) decreases by de- 
creasing the rank of T. But if rank(T) > uca, 
Tank(y/E(T <g) CC^)E^) = n c , and it does not depend 
on rank(r). In the second case, T should be full rank. 
Therefore the rank of the optimal V is 



J rank(r opt ) = n C A if n C A < m E 
|rank(r opt ) = m E if n CA > m E , 



(32) 



which agrees with (22). Note that the same argument also 
applies in the average case (15) with E replaced by i? avg . 



B. Numerical result for randomly generated error maps 



Here, we examine the result above for randomly generated 
error maps. Namely, we find the rank of the optimal T for each 
random map by applying the indirect optimization method. 
The error map is modeled as shown in Fig. 2 as a unitary U e 
acting on the joint codespace-bath Hilbert space. The unitary 
Ue arises from a randomly selected m^nc x m^^c time- 
independent Hamiltonian He, i-e., Ue — e~ ttHE (we work in 
units where h = 1). The unitary evolution operator generated 
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by this Hamiltonian at time t = 1 is 



U E = exp(-iH B ) = 



E-2 



(33) 



That is, we pick the first rip columns of the matrix Ue- Here, 
Ei . . . E m are the nc x "c OSR elements of the error opera- 
tion, and from (8), E = [Ei ■ ■ ■ E„ 1e \. 

Figure 3 presents the channel fidelity vs. the number of it- 
erations in Algorithm 1 for 100 random error maps. In this 
experiment, the system is a single qubit and one qubit is used 
as an encoding ancilla, i.e., ns = 2, uqa = 2. Each error 
map has 4 OSR elements, i.e., tue = 4, and is generated us- 
ing a 16 x 16 random Hamiltonian matrix according to (33). 
Therefore, the matrix T in (20) is 4 x 4. Figure 4 shows the 
histogram of the rank of V vs. the number of iterations. This 
histogram indicates that after 20 iterations in the optimiza- 
tion algorithm, the rank of T is always two, which is equal to 
Uca- In fact, those V that are not rank 2 after 10 iterations are 
associated to the error maps with lower rate of fidelity conver- 
gence. 
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FIG. 4: Rank of the optimal T for random error maps on two-qubit 
codes. 
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singular Value index 



FIG. 5: Singular values of the optimal T for random error maps on 
two-qubit codes. For all cases tested only two of the singular values 
are significantly different from zero, meaning that the rank of the T 
matrices is 2. 



number of iterations 



FIG. 3: Channel fidelity / for random error maps on two-qubit codes. 



Figure 5, which shows the singular values of the same T 
matrices, is included for comparison of the magnitude of the 
singular values. In all cases, the nonzero singular values are 
of the order of 10 _1 . The numerical precision of all the re- 
sults is 10~ 8 . We repeated the experiment for more than 1000 
random maps with different dimensions (only 100 are shown), 
and the result holds for all of them. Namely, after sufficiently 
many iterations in Algorithm 1, the rank of the optimal T is 
the same as the dimension of the encoding ancillas space, i.e., 
rank (T opt ) = n C A- 



V. EXAMPLES 

We now apply the methods developed above to the goal 
of preserving a single qubit (n s = 2) using a qc-qubit 
{n c = 2 qc ) codespace. In these examples, the error chan- 
nel £ consists of single-qubit errors occurring independently 
on all qubits with probability p. We examine two cases of 
bit-flip and bit-phase-flip errors. 



A. 3-qubit bit-flip errors 

In this example, we consider the independently occurring 
bit-flip error as the noise channel, where the bit-flip operator 
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FIG. 6: Channel fidelity / vs. bit-flip probability p for 3-qubit en- FIG. 7: Channel fidelity / vs. bit-flip probability p for 3-qubit en- 
coding, coding in two ranges: p < 0.5 and 0.5 < p < 0.9. 



is X = I y M . We used = 2 encoding ancilla qubits. 
There are 2 3 = 8 OSR error elements for 3-qubit encoding: 

{-E»}i=i = a h ® A i* ®A i3 , i!,h,i 3 G {1,2} 

^i = VTl -P) 7 (no error) (34) 
A-2 = \fp X (bit — flip error) 

Figure 6 shows / vs. bit-flip probability p in the range 
p < 0.9 for the standard 3-qubit code, optimal recovery at 
each p, average-case recovery over the p range, and no recov- 
ery. For the average case, we computed an optimized encod- 
ing and recovery for the single channel obtained by averaging 
over the error channels corresponding to p = 0, 0.1, ...,0.9 as 
defined in (15). We then applied this encoding and recovery 
to each of these 10 channels, thus producing the 10 fidelity 
values shown. Note that the optimal recovery can be achieved 
equivalently by either the constrained least squares method 
( Algorithm-2) or the convex optimization method (Algorithm- 
1). Interestingly, the standard 3-qubit code not only pro- 
vides optimal recovery for the range p < 0.5, it is optimal 
for both recovery and encoding in this range. For p > 0.5 
the standard code is clearly no longer optimal. Only in this 
range does the optimal recovery outperform the standard code, 
a phenomenon similar to what was reported for amplitude- 
damping errors in [7]. Analysis of our optimal encoding re- 
covery results reveals the following simple picture. The opti- 
mal code is the standard 3-qubit code for the entire p range, 
i.e., |0) = 1 000) and |1) = |111). The optimal recovery is 
the standard recovery [6] in the range < p < 0.5. In the 
range 0.5 < p < 0.8 the optimal recovery is a bit-flip on all 
qubits followed by the standard recovery. 

Figure 7 shows channel fidelity / in two ranges: p < 0.5 
and 0.5 < p < 0.9. Unlike the previous case, here we 
compute the optimization twice, once for each range. For 
the average case, we computed an optimized encoding and 



recovery for the single channel obtained by averaging over 
the error channels corresponding to p = 0,0.1, ...,0.4. We 
then applied this encoding and recovery to each of these 5 
channels, thus producing the 5 fidelity values shown in the 
range < p < 0.5. We then repeated this procedure for 
p = 0.5, 0.6, 0.9. For p < 0.5, the standard, optimal, 
average-case, all coincide. For p > 0.5, the optimal and 
average-case codes coincide and divert again from the stan- 
dard. The optimal encoding and recovery are the same as in 
Figure 6, i.e., the standard 3-qubit code, with standard recov- 
ery in the range < p < 0.5, and bit-flips preceeding standard 
recovery in the range 0.5 < p < 0.9. We conclude from the 
examples in Figures 6 and 7 that optimal encoding and recov- 
ery has no advantage over standard encoding and recovery for 
low bit-flip probabilities (p < 0.5), and thus increasing the 
codespace would be required to improve fidelity. For large 
errors (p > 0.5), optimization is more effective in that it iden- 
tifies an optimal recovery. In both cases the achieved opti- 
mal fidelity is independent of the number of recovery ancillas 
used, hence in all examples shown in Figures 6 and 7 there are 
no additional recovery ancillas required. It is striking that the 
average case fidelity matches the optimal in Figure 7, but not 
in Figure 6. This is entirely due to the range of p values over 
which the average is performed. The lesson is that the more 
information is available about the noise channel, the more ro- 
bust the encoding and recovery will be: in Figure 7 we know 
that the probability is in the range [0, 0.5] or [0.5, 0.9], while 
in Figure 6 we only know that it is in the range [0, 0.9] . Absent 
such information, robustness may still be attainable by exper- 
imenting with tuning the encoding and recovery over a range 
of channels. 
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FIG. 8: Channel fidelity / vs. bit-phase flip probability p for 3 qubit 
code and 0, 1, or 2 recovery ancillas, with optimal encoding and 
recovery. 



FIG. 9: Channel fidelity / vs. bit-phase flip probability p with a fixed 
total of 6 ancillas, and optimal encoding and recovery. 



B. Bit-Phase flip error 

In this example, the noise channel consists of bit-phase flip 
errors Y = \ 9 ~ ! ' 1 occurring independently with probability 

p. We do not allow for more than three to occur simultane- 
ously (i.e., we consider weight-3 errors). We examine two 
cases: 1. Considering a fixed number of encoding ancillas, 
we compare the fidelity using different numbers of recovery 
ancillas. 2. We fix the total number of available ancilla qubits, 
and compare the fidelity for various distributions of encoding 
and recovery ancillas. 



1. 5 -qubit bit-phase flip error 

In this example, the bit-phase flip errors occur indepen- 
dently on the input qubit and 4 ancillas. There are 26 error 
OSR elements: 1 for no error, 5 for a single error, 10 for dou- 
ble errors, and 10 for triple errors. Thus the matrix T in (20) is 
26 x 26 and the rank of r opt is equal to tica = 16, meaning 
that the optimal distribution of ancillas is having all four in the 
encoding block and none in the recovery block. 

Figure 8 shows / vs. bit-phase flip error probability p for 
the optimal encoding/recovery in the case of zero, one and 
two recovery ancillas. The result shows that all cases yield 
the same fidelity. Therefore, the fidelity of the system is inde- 
pendent of the number of recovery ancillas. 



2. bit-phase flip errors with a fixed number of ancillas 

In this example, we consider six ancilla qubits that can be 
used either in the encoding block or in the recovery block. 
We compare the fidelity for the following distributions: four 



encoding ancillas and two recovery ancillas, five encoding an- 
cillas and one recovery ancilla, and six encoding ancillas with 
no recovery ancilla. Figure 9 shows that the channel fidelity 
increases significantly by using the ancillas in the encoding 
instead of the recovery. Thus the most efficient use of ancillas 
is achieved when they are all used for encoding. 

VI. CONCLUSION 

We have presented an optimization approach to quantum 
error correction that yields codes which achieve robust per- 
formance, when tuned to a specific noise channel. An impor- 
tant aspect of developing optimal codes which are tuned to 
a class of errors, or are robust over a range of errors, is that 
the optimized performance levels may be sufficient for the in- 
tended purposes. Hence, no further increases in codespace 
dimension may be necessary. This cannot be known without 
performing the optimization. 

We also showed that the fidelity of such a system is inde- 
pendent of the number of the recovery ancillas. This is en- 
tirely due to the structure of the error correction optimization 
problem, for which we found that a unitary recovery operator 
maximizes the fidelity of the system. However, the fidelity 
increases significantly by increasing the dimension of the en- 
coding ancillas space. Therefore, in the optimal quantum er- 
ror correction scheme, one should use all the available ancilla 
qubits in the encoding block. 

Although not further developed here, the resulting codes, 
unlike standard codes, have support over all basis states. Some 
of the recovery structure is revealed via the indirect approach. 
This in turn leads to a method for approximating optimal re- 
covery involving only a singular value decomposition, making 
it potentially useful in evaluating very large blocks of encod- 
ing to see if further performance improvement is possible. 

We stress that there is an important difference between the 
standard error correction schemes [1-6] and the approach pre- 
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sented here. While in the standard case only the class of errors 
should be known, in our method the exact form of the noise 
map is required for optimization. In general, the noise map 
can be identified using quantum process tomography [11]. In 
most cases this extra knowledge is equivalent to identifying 
the probability of the error, which can also be found using our 
method. In order to identify the probability in a particular er- 
ror model, one should calibrate the fidelity of the system using 
a fixed pair of recovery and encoding operators. Once the re- 
lation between the fidelity associated to this pair and the error 
probability is known, a measurement of the fidelity yields the 
probability. 

It thus appears that the effectiveness of optimization is de- 
pendent upon the structure of the error operation, a result 
seemingly heralded by Feynman [30]: 

"In a machine such as this there are very many other 
problems due to imperfections. . . . there may be small 
terms in the Hamiltonian besides the ones we've writ- 
ten. ... At least some of these problems can be reme- 
died in the usual way by techniques such as error cor- 
recting codes . . . But until we find a specific imple- 



mentation for this computer, I do not know how to pro- 
ceed to analyze these effects. However, it appears that 
they would be very important in practice. This com- 
puter seems to be very delicate and these imperfections 
may produce considerable havoc." 

Determining the "specific implementation" is currently an 
on-going research effort. Analyzing the "effects" however 
will undoubtedly be accomplished by a combination of phys- 
ical modeling and/or system identification (e.g., process to- 
mography and parameter estimation). This leads to an intrigu- 
ing prospect: to integrate the results found here within a com- 
plete "black-box" error correction scheme, that takes quantum 
state or process tomography as input and iterates until it finds 
an optimal error correcting encoding and recovery. 
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APPENDIX A: PROOF OF EQUATIONS (9), (17), (19) 

The n c x n c n RA matrix W — E(A' (g)C) has a maximum 
rank of n c . Hence a singular value decomposition is of the 
form W = USV^ , S = [So 0] , with 5 an n c x n c diagonal 
matrix containing the n c singular values. If V is partitioned 
as V = [Vi V2] with V\ being n RA n c x n c then the objective 
function in (17) becomes, 

Re Tr RW = Re Tr S X, X = V?RU (Al) 
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Since \\X\\ < 1, then Rc TV S X < Tr S . Equality occurs 
if and only if X = Ic, or equivalently, R = V\U\ which 
is precisely the result in (19). This also establishes that the 
optimal objective function is Tr So which, by definition, is 
equal to TrVWW^ thus 



max RcTr RW 

RlR=I c 



TrVWW 



(A2) 



which establishes (17). 

Condition (9) follows directly from (5) by multiplying both 
sides by their respective conjugate (with indices c and c') 
which also eliminates R because R^R = Ic- This imme- 
diately establishes that (9) is a necessary condition for (5). To 
prove sufficiency, first expand (6) to get, 

d = J2 c Tr(I E ® Cl)E^E(I E ® C c ) 

+Tr r c ® I s - 2ReTf RE{A\ ® C C L\) (A3) 

r c = AtA c 



From (A2), we get, 



W 



EjTr (I E ® Cj)E* E{I E g C c ) 
+Tr T c ® I s ] - 2lWWWt 
£ c i?(At (g) C c 4) 



Using (9) we get, TrW^W^t 



(A4) 

£ c £(J B ® C c C c t)Et. This, 



together with repeated uses of (9) shows that min R \ R=Ic d = 
0. Since d is a norm, and is zero, then so is its argument, 
which by definition establishes (5) and thus shows sufficiency 
of (9). ' 



APPENDIX B: RELATION BETWEEN FIDELITY / AND 
DISTANCE d 



The problem is, 

minimize d = J2 C \\RE(I E ® G c ) - A c ® L s |i 
subject to J2 C \\ A c\\l = 1 

Form the Lagrangian, 

L = d + \{Y J Tr AjAc-1) 



(Bl) 



(B2) 



with A the Lagrange multiplier. Then, V5 T . cc L = 
when (n s + A)5 rec = Tr R r E e C c L^ s . To enforce the 
constraint ^ c ||A C ||| = 1 requires that (n s + A) 2 = 
J2 rec \T±RrE e C c Ll\ 2 . Hence, 



Ac = A c /^Ecl|Ac||| 
S rec = Tr{R r E e C c Ll)/n t 



(B3) 



Observe that J2c ll A c|j| = /■ This together with 
J2 r R\R r = Ic, C\C C = Is gives the optimal distance 
as given implicitly by (12). Note also that with no constraint, 



A = 0, the A c are the optimal least-squares (unconstrained) 
solution. 



APPENDIX C: UNITARY FREEDOM IN EQUATION (17) 

In (17), r = At A remains unchanged if A is multiplied by 
a unitary. This unitary freedom is exactly the unitary freedom 
in describing the error map OSR. To see this, recall again from 
[6, Thm.8.2] that two error maps with OSR elements E = 
[Ei . . . E mE ] and F = [Fi . . . F mE ] are equivalent if and only 
if Ei = J2j WijFj where the m E x m E matrix W is unitary. 
Equivalently from (A3), E = F(W ® I c )- Substituting this 
for E into the left hand side of (17) gives, 



Re Tr i?£(A T ® CL\) = Re Tr i?F((A') T ® CL\) (CI) 

with A' = AWt. Hence, A'*A' = WA^AW^ = T, which 
establishes the claim. 



APPENDIX D: SOLVING EQUATION (20) VIA AN SDP 
Problem (20) is of the form, 

(Dl) 



maximize Try/ F(T) 
subject to T > 0, Tr T = 1 



where F(T) is linear in V. Consider the relaxed problem, 
maximize Tr Y 



subject to F(T) - Y 2 > 0, T > 0, Tr T = 1 
This is an SDP in T and Y with Lagrangian, 

L{T,Y,P,Z) = -TrY -Tr P(F(T)-Y 2 ) 



Tr ZT + A(Tr T - 1) 



(D2) 



(D3) 



The dual function is, 



g(P,\,Z) = in£ r , Y L(T,Y,P,Z) 

[ infy Tr(PF 2 —Y) — X, Z = XI - A(P) 
1 — oo otherwise 

(D4) 

with A(P) = |rTr PF(T), which is not dependent on T 
because F(T) is linear in T. Performing the indicated infy 
gives Y = {l/tyP- 1 and g = -(A + (l/4)Tr p- 1 ). The 
dual optimization associated with (D2) is to maximize g, or 
equivalently, minimize its negative, i.e., 

minimize A + ^TrP -1 __. 

(D5) 

subject to P > 0, XI - A(P) > 

This is an SDP in the dual variables P, X. For this prob- 
lem strong duality holds [12]. Consequently, at optimality 
of (D2) and (D5) the complementary slackness condition is 

Po P t(-F(r o p t ) - r o 2 pt ) = 0. Since P opt > 0, we have 
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y opt = y/F(T p t ). This establishes that solving the SDP 
(D2) is equivalent to solving the original problem (Dl). 

APPENDIX E: SOLVING FOR C IN EQUATION (24) 

The problem is, 

minimize d = £) c \\RE(I E ® C) - A ® L s ||| 
subject to CtC = I s 

Form the Lagrangian, 

L = d+TrP(C t C'-/ s ) (E2) 



with P the Lagrange multiplier. Then, VcL = when 
C = C(I S + P)- 1 with C as defined in (24). To enforce 
the constraint C^C = Is requires that (I s + P) 2 = C^C. 
Hence, C = C(C^C)~ 1/2 . The actual computation of C is 
done using the SVD (25)-(26). Note that with no constraint, 
P = 0, and C is the optimal least-squares (unconstrained) 
solution. 



